


A METHOD FOR DESIGNING AN INITIATOR IN 



AN INTEGRATED CIRCUIT 



FIELD OF THE INVENTION 

[0001] The present invention relates to a method of designing an integrated circuit and a 
model for use in designing an integrated circuit. 

BACKGROUND OF THE INVENTION 

[0002] As integrated circuits become more complicated, it has become harder to 
translate an initial design into a silicon design. In principle it is possible for the silicon 
design to be drawn up by an individual. However in practice, this is difficult for an 
individual designer to do as there are a huge number of components on a chip. One small 
error in the design may result in a faulty integrated circuit. Additionally, this process is 
very slow and can significantly delay the amount of time taken to get the integrated 
circuit to the manufacturing stage. 

[0003] Various computer programs have been proposed to assist in the design of an 
integrated circuit and more particularly in the testing of a design. These computer 
programs have typically been in the form of digital simulators which simulate a circuit 
under test. A hardware description language (HDD has been designed to simulate and 
describe the behavior of digital circuitry. However, whilst programs such as HDL are 
usefial in testing a design, they do not assist in the design of the integrated circuit itself 

[0004] Additionally, once the higher level design of an integrated circuit has been 
completed, it can be a laborious process to obtain the gate level design which provides 
the higher level fiinction. 

SUMMARY OF THE INVENTION 

[0005] It is therefore an aim of embodiments of the present invention to provide a 
method which is able to reduce the amount of time required in order to design an 
integrated circuit. 
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[0006] According to one aspect of the present invention, there is provided a method for 
designing an initiator in an integrated circuit, said initiator being connected to an 
interconnect and arranged to issue requests, said method comprising the steps of: 

defining if the initiator or the interconnect is to be responsible for ordering 
responses to requests issued by said initiator; 

defining the maximum number of requests which are permitted to be outstanding 
at the same time; and 

defining if a delay stage is required in said initiator port. 

[0007] According to a second aspect of the present invention, there is provided a method 
for designing an interconnect having routing resources, said interconnect arranged to 
allow initiators to send requests to targets, said method comprising the steps of defining: 

the number of routing resources between the initiator and the target; 

the arbitration method for arbitrating between requests; and the association 
between the routing resources and the targets. 

[0008] According to a third aspect of the present invention there is provided a method 
for designing an interconnect having routing resources, said interconnect arranged to 
allow targets to send responses to initiators in response to requests fi-om initiators, said 
method comprising the steps of defining: 

the number of routing resources between the target and the initiator; 

the arbitration method for arbitration between responses; and the association 
between the routing resources and the initiator. 

[0009] According to a fiorther aspect of the present invention there is provided a method 
of designing an arbiter in an integrated circuit comprising initiators, targets and an 
interconnect, said arbiter being provided between said targets and said interconnect, said 
method comprising the steps of 

using an arbitration model having a plurality of different arbitration methods and 
selecting one of the plurality of arbitration methods available in said model. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0010] For a better understanding of the present invention and as to how the same may 
be carried into effect, reference will now be made by way of example to the 
accompanying drawings in which: 

[0011] Figure 1 shows a block diagram of the elements of a typical integrated circuit; 

[0012] Figure 2 shows a model for use in designing the implementation of the integrated 
circuit; 

[0013] Figure 3a shows a typical request packet structure; 

[0014] Figure 3b shows a typical response packet structure; 

[0015] Figure 4 shows the initiator side of the model of Figure 2 in more detail; 

[0016] Figure 5 shows the target side of the model of Figure 2 in more detail; and 

[0017] Figure 6 shows a block diagram illustrating the method embodying the present 
invention. 

DETAILED DESCRIPTION OF EMBODIMENTS OF THE PRESENT 
INVENTION 

[0018] Reference will be made to Figure 1 which schematically shows various 
components of an integrated circuit 2. The integrated circuit 2 has a distributed routing 
network 4. The distributed routing network 4 can be a series of dedicated connections, 
one or more shared connections or a mixture of dedicated and shared connections. One 
example of a shared connection is a bus. 

[0019] A number of modules 6 are connected to the distributed routing network 4. Any 
number of modules can be connected to the routing network 4. The modules can be of 
any form. For example the modules may include one or more of the following modules: 
CPU; external memory interface; debug module; external interface circuitry; and the 
like. 
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[0020] Each module 6 has an initiator port 8 and a target port 10. The initiator ports 8 
are arranged to output requests originating from the module 6 to the distributed routing 
network 4. The responses to the output requests are returned to the respective initiator 
port 8 via the distributed routing network 4. The target ports 10 are arranged to receive 
requests from the initiator ports 8 of other modules via the distributed network 4 and to 
output a response to the requesting module via the distributed network 4, 

[0021] Each of the initiator ports 8 and the target ports 10 are connected to a central 
control logic 12 via lines 13 which controls the distributed routing network 4 via control 
signals, 14. The central control logic arbitrates between the requests of the initiator ports 
8 to determine which one or more requests are allowed onto the distributed routing 
network at a given time (for example a given cycle). A similar arbitration frmction may 
be provided for the responses of the target ports. The requests and the response may use 
the same or different routing resources in the distributed routing network 4. 

[0022] To build a system on an integrated circuit it is necessary to link together macros 
which together define the system as a whole. Embodiments of the present invention are 
arranged to provide a macro which can be easily created and which reduce the design 
and verification resources required to build a system. The user defines the parameters for 
the model which will be described in more detail hereinafter. The resultant macro can 
then be included in the design for synthesis. The model can be customized to implement 
more sophisticated operations or to implement specific interconnect structures. 

[0023] The model described hereinafter is designed to create interconnect nodes for bus 
interconnect architectures. 

[0024] Reference is made to Figure 2 which shows a model which is used to design an 
implementation of an integrated circuit having the structure of the circuit of Figure 1 . 
The model allows the characteristics of the elements to be defined and the organization 
of the elements to be defmed. 

[0025] The model shown in Figure 2 has an initiator port 8. The initiator port includes 
one or more of the following fiinctional blocks: 

address decode block 20; 
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access queue block 22; 
dependency block 24; and 
initiator re-timer 25. 

[0026] As will be appreciated, not all embodiments of the present invention require the 
initiator port to have these functional blocks. 

[0027] The address decode block 20 is connected to the main part of the initiator port. 
The output of the address code block is connected to the dependency block 24 as is the 
output of the access queue block 22. The access queue block receives an input from the 
main part of the initiator port. The output of the dependency block is connected to the 
initiator re-timer block 25. 

[0028] The target port includes one or more of the following fimctional blocks: 

locking block 26; 
access queue block 28; 
decode block 30; and 
target re-timer 32. 

[0029] Again, not all embodiments of the present invention require the target port to 
have these functional parts. 

[0030] The decode block 30 and the access queue block 28 receive an input from the 
main part of the target port 10. The output of the decode block 30 is connected to the 
target re-timer 32, The output of the access queue block 28 is connected to the 
dependency block 24 of the initiator port. The locking block 26 and the access queue 28 
are connected to the request transport 34. The output of the locking block 26 is 
connected to the dependency block 24 of the initiator port. 

[0031] The function of the various functional blocks of the initiator and target ports will 
be described in more detail hereinafter. 
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[0032] The request transport 34 and the response transport 35 comprise the distributed 
routing network 4. For clarity, the response transport and the request transport are shown 
separately. In practice, the request transport and the response transport may be separate 
or may be, at least partially a shared resource. 

[0033] A request transport arbiter 38 is provided which arbitrates between the requests 
of the initiator ports 8 and controls which requests are allowed onto the request transport 
at a given time which may be a given clock cycle. The request arbiter is connected to the 
initiator re-timer 25 of each initiator port. A request arbiter re-timer 40 is also provided 
between the arbiter 38 and the request transport 34. 

[0034] A response transport arbiter 42 is also provided which arbitrates between the 
responses of the target ports 10 and controls which responses are allowed onto the 
response transport at a given time, for example a clock cycle. The output of each target 
re-timer 32 is connected to the arbiter 42. A response arbiter re-timer 44 is also provided 
between the response arbiter 42 and the response transport 35. 

[0035] It should be appreciated, that at least some of the elements of the model are 
optional. For example, the access queue in the initiator block may not be present or the 
access queue in the target port may not be present. 

[0036] When an integrated circuit having the structure described in relation to Figure 1 
is to be designed, the following steps are performed. These steps define the 
characteristics of the initiator and target ports, the request and response transports and 
the relationship between the request and response transport and the initiator and target 
ports. 

[0037] A number of parameters are defined. These parameters can be defined by the 
user or can be produced by a computer program or the like. These parameters describe 
the system to be built. These parameters can be stored in a file. Locations may be 
defined in data stores for the values of different parameters. These data stores may be 
arranged so that different parts of the store are associated with different types of 
parameter. These parameters can be divided into different categories: 

global parameters which relate to the entire system; 
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initiator port parameters; 

initiator port to target resource (request bus) parameters; 
target port parameters; and 

target port to initiator resource (response bus) parameters. 

[0038] The first parameter to be defined is a global parameter and applies both to the 
initiator ports and to the target ports. This parameter is data_width_k which defines the 
size of a word in the distributed routing network, that is the request transport and 
response transport of Figure 2. The sizes of the following fields are related to this 
parameter and have the following definitions: 

the size of the address of a packet in the range 3 1 : data_width_k; 

the size of the mask is in the range (2^"^>^'*^Ai):0; and 

the size of the data is in the range ((8 x 2^^^>'^*.'').l) : 0. 
[0039] The value of data width k encodes the word size as a fiinction of 2" bytes. 

[0040] The structure of typical examples of bus (transport) messages will now be 
described with reference to Figures 3a and 3b. Figure 3a shows the format of a request 
packet. The request packet has a first field of Fl of 32 bits, first 8 bits A are used by the 
request transport to identify the target thus route the packet. The remaining 24 bits B, 
which are sometimes referred to as the address, are used by the target port to identify. a 
location within an associated module or a function of that module. The second 24 bits B 
are used by the request transport in order to route the packet. 

[0041] The request packet also includes an 8 bit source field F2 which identifies source 
of the request. Jn other words, information identifying the initial port fi'om which the 
request originates is included. This 8 bit address can be the same format as the 8 bit 
address A at the head of the packet. 

[0042] The packet also has an 8 bit field F3 which identifies the type of transaction, hi 
other words, this 8 bit field contains the op-code. One of the bits of op-code field defines 
the packet as being a request packet or a response packet. For other bit positions in the 
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op-code field F3 of the request packet the size and type of the transaction are defined. 
For example, the code m define the transaction as being a read or a write transaction if 
the request packet is intended for a memory interface target or a similar target. 

[0043] The request packet also includes a transaction identifier field F4 which is 8 bits 
wide. This field is used to identify the transaction number. This allows related 
transactions to be processed in the correct order. 

[0044] The request packet may also include a data field F5, which contains data for the 
target. Only some types of request packets, such as write packets, will contain data. The 
size of the data field is defined by the data_width_k parameter. 

[0045] The response packet will now be described with reference to Figure 3b. The 
response packet does not have the same address field as a request packet but rather has 
the 8 bit source field from the request packet as its address in its first field F6. This is 
used to route the response packet back to the initiator which issued the request. The 
response packet also has a second 8 bit field F7 which includes an 8 bit opcode. One of 
the bits of this field will defme the transaction as being a response. For responses, only 
one other bit of the opcode is used and this indicates if the response is a valid response or 
an error response. 

[0046] The packet may also include in field F8 n bits of requested data for example in 
the case of a read request being issued by the requesting module. Not all response 
packets will include data. The parameter data_width_k defines the size of this field. 

[0047] Finally, the response packet also includes a transaction identity field F9 which 
provides transaction identification information. This information may allow related 
response packets to be sent consecutively on the response transport if required. 

[0048] It should be appreciated that the request and response packets shovm in Figure 3a 
and 3b may be replaced by any other suitable packet structure which may have different 
fields, additional fields or only some of the fields shown in the figures. For example the 
response and request packets may be provided with an end of packet indicator. The order 
of the fields in the packets shown in Figures 3a and 3b may be different in different 
embodiments of the present invention. It should be noted that a packet may contain more 
than one request. 
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[0049] Embodiments of the present invention can be implemented in systems where the 
requests and responses are not in a packet format. 

[0050] The parameters which are defined for the initiator ports are as follows: 

[0051] Firstly, the number of initiator ports 8 is defined. This parameter is 
initiator_number_k where k is the number of initiator ports. 

[0052] For each initiator port, the following parameters are defined. These parameters 
are stored in a hot encoded array. Each element or row of the array corresponds to a 
respective one of the initiator ports. Each position in the row stores a value of a 
predetermined parameter. 

[0053] For each initiator port, it is determined if a re-timing stage 25 is required after the 
address decode and/or dependency stages. The associated parameter is the 
initiator retime k parameter. If the value of this parameter is '0', then no re-timing stage 
25 is required and the re-timing stage 25 is effectively not there. If the value of the 
parameter is T, then this will increase the arbitration latency of the associated port by 
one cycle. In other words, each request fi-om the initiator port is delayed by one cycle 
before it is presented to the arbiter 38. This has the effect that the address functions are 
removed fi-om the critical path in the design. It should be appreciated that in alternative 
embodiments of the present invention, the delay can be selected to be more than one 
cycle. In alternative embodiments of the invention, the delay is not defined in terms of a 
clock cycle but in any other suitable manner. 

[0054] The next parameter to be set is the initiator_ordering_k which is set for each 
initiator port. If the initiator_ordering_k parameter has the value T, then the request 
transport 34 is responsible for ensuring that the requests are in the required time based 
order and that the responses to each request are received in a time-base order 
corresponding to the requests. If the parameter has the value '0*, then the initiator port is 
responsible for ensuring that the requests remain in the required order. 

[0055] For example, the order of the requests may be REQ 1, REQ 2, and REQ 3. When 
the initiator_ordering_k parameter is '1', the responses will be received by the initiator 
port in the order RESP 1, RESP 2 and RESP 3. RESP 1 is the response to request REQ 1 
and so on. If the parameter is '0', then the responses can be received in any order, for 
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example RESP 2, RESP 3 and RESP 1. The initiator port will reorder the responses to 
the correct order. 

[0056] The model is thus able to support both of these methods. Furthermore, this model 
is able to support both of these methods being used with different initiator ports in the 
same integrated circuit. This is a particularly advantageous feature of embodiments of 
the present invention. 

[0057] Another parameter which is defined for each initiator port is the 
initiator queue_depth_k parameter. This determines the maximum length or depth of the 
access queue block 22. hi particular, this defines the maximum number of requests 
which can be outstanding at a given time. A request remains outstanding until a response 
has been retumed to that request. If the maximum number of requests is outstanding, no 
fiirther requests will be accepted until one or more of the outstanding requests has 
received a response. 

[0058] The parameters which relate to the relationship between the initiator port and the 
target resource will now be described. 

[0059] The first one of these parameters is the resource_number_k which defines the 
number of routing resources available fi-om the initiator port to the target port. This value 
will be an integer, hi preferred embodiments of the present invention, a maximum of one 
resource is provided for each target fi-om each initiator. In alternative embodiments of 
the present invention it may be possible to provide more than one resource. Each routing 
resource is considered as a separate resource for arbitration and corresponds to .a 
transport path. In other words, packets of data firom the initiator port are directed to the 
target path via the transport path or routing resource. 

[0060] The resource_arbitration_k defines the arbitration model which is used by the 
arbiter 38. For example if the value of the parameter is T, then a least recently used mode 
of arbitration is used. If the parameter has the value '0', a method of arbitration may be 
used in which the initiator ports have a fixed priority. It should be appreciated that the 
arbiter is arranged to arbitrate between some or all of the requests from the initiator. 
More than two methods of arbitration may be provided. The two arbitration methods 
discussed hereinbefore are only examples and can be replaced by any one or more other 
arbitration methods. 
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[0061] The resource_retime k parameter defines whether or not the request arbiter re- 
timer 40 provides any delay or not. If the parameter has the value '1', then the re-timer 40 
delays the output of the request arbiter 38 by one cycle. This increases the arbitration 
latency for this resource and all the associated target ports by one cycle. If the parameter 
has the value '0', then the re-timer 40 provides no delay and effectively does not exist. 

[0062] The final one of these parameters is the resource_mapping_k. This parameter 
defines the association between the routing resource and the target ports for requests 
fi-om the initiator port to the target port via the request transport. For a given routing 
resource x, if a bit associated with a specific target port is set to '1', then that routing 
resource is associated with that target port. A target port is, in preferred embodiments of 
the present invention, associated with only one routing resource whilst one routing 
resource may be associated with one or more target ports. In altemative embodiments of 
the present invention, a target port may be associated with more than one routing 
resource. This parameter thus detemiines for each target port the available routing 
resource. 

[0063] The next set of parameters to be set are those relating to the target port. 

[0064] The first one of these is target_number_k which defines the number of target 
ports to be provided. Each target port is assigned a row or the like of an array. The values 
of parameters associated with each target port are stored in the assigned row of the array. 
Each parameter is associated with a predetermined position in the array. The array may 
be a hot encoded array. The array may be the same as that used to store parameters 
relating to the initiator ports. 

[0065] For each target port, the following parameters are defined- Firstly, the parameter 
target_ready_k is set. If this parameter has the value T, then it is assumed that the target 
device is always ready to receive a request. If the value is set to '0', it is assumed that the 
target device is only ready to receive a request if it has sent a ready message to the 
arbiter. This may be sent automatically or in response to a status request. 

[0066] In one embodiment of the present invention, the integrated circuit is such that the 
return addressing for the responses fi*om the targets to the initiators is not known. In 
other words, the return addressing of a response packet in response to a request packet is 
implicit. In an altemative embodiment of the present invention, the responses have an 
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address portion which identifies the initiator port corresponding to the request associated 
with the respective returned response. 

[0067] If the target _ retime _k parameter is set to T , the re-timer 32 delays the output 
of the address decode/dependency stages by a clock cycle. This will have the effect of 
increasing the arbitration latency for the particular target port by a cycle but removes the 
address fiinctions fi-om the critical path in the design. If the attribute is set to '0', then the 
target re-timer 32 does not provide a delay and is effectively not there. 

[0068] The target_ordering_k parameter is set to T if the target device is required to 
reorder the responses. This will be required if the initiator_ordering_k parameter is set to 
1 . If the target_ordering_k parameter is set to zero, the target device will not reorder the 
responses. 

[0069] The target_identity_loopback_k parameter is set to '1' if the interconnect i.e. the 
response transport 35 stores and loop backs the source and identity information 
associated with the transaction. This will then remove the requirement fi"om the target 
device. In other words, if the parameter is set to T, the response transport identifies fi*om 
where a given request has been received and will direct the response back to the, correct 
initiator. An attribute of a packet may be decoded in order to determine the address to 
which to return the response. In alternative uses of the model, the addresses will be 
stored in the queue block 28. 

[0070] The locking block is provided in those embodiments where the initiator and 
target ports are to exchange a plurality of requests and/or responses. When the locking 
mode is selected, no other initiator is permitted to send requests to the target until the 
exchange of the pluraUty of responses and requests have been completed. 

[0071] If the target_identity_loopback_k attribute is T, the user has to specify the 
number of possible outstanding requests the target device can support. This is the 
targetqueue_depthk parameter, 

[0072] The next group of parameters which need to be set relate to the path between the 
target port and the initiator port for the response packets. 



\\\BO - 80408«)080 - 151729 vl 



12 



• 



[0073] The first parameter which is set is the retum_resources_number_k parameter. 
This is an integer which defines the number of routing resources between the target port 
and the initiator port. Each routing resource is considered as a separate resource for 
arbitration and corresponds to a transport path between the target port and the initiator 
port, hi preferred embodiments of the present invention, the maximum number of 
resources available to a given target device is one. However, in alternative embodiments 
more than one resource may be available. 

[0074] The retum_resource_arbitration_k parameter defines the arbitration method used 
by the response arbiter 42. If the parameter is set to '1', the least recently used arbitration 
mode is used for arbitrating between the responses fi-om the various target ports 10. If 
the parameter is set to '0' , a simple fixed priority scheme is implemented. One of the 
target ports is defmed to have the highest priority. As with the resource_arbitration_k 
parameter described hereinbefore, other arbitration schemes may be available or 
alternatively used. 

[0075] The retum_resource_retime_k parameter is set to '1* if the response arbiter re- 
timer 44 associated with the response arbiter 42 is to provide a delay of one clock cycle. 
This will increase the arbitration latency for the response transport 35 and all the 
associated target ports 10 by one cycle. If the parameter has the value '0' then the 
response arbiter re-timer 44 does not provide any delay and is effectively not there. 

[0076] The retum_resource_mapping_k parameter defines the association between the 
routing resource of the response transport 35 and the initiator ports 8. For a given routing 
resource N, if the bit associated with the specific target is set to '1', then that routing 
resource is available for the target device to send its response packets to the initiator port. 
It should be appreciated that an initiator port may be associated with only one routing 
resource in preferred embodiments of the present invention. The roufing resource maybe 
associated with one or more targets. In altemafive embodiments of the present invention, 
more than one routing resource may be associated with an initiator. 

[0077] The following parameters define the initiator/target mapping. The 
forward_decode_f is a mapping fimction for requests from an initiator port to a target 
port. In particular, the ftinction defines the mapping between the request packets and the 
target ports. The same fimction is used by all initiator ports. However, the initiator 
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identification information (which is included in the request packet) may be used to 
customize the address map if a specific initiator device requires customization. 

[0078] The fimction provides a one hot encoded array containing one bit per target port. 
Element 1 of the array corresponds to the first target port and so on until the final 
element corresponds to the last target port. The request fi-om a given initiator will be 
mapped to a corresponding one of the target ports. The mapping is point to point which 
means that a given request fi-om a given initiator can only be mapped to a single target 
port. In alternative embodiments of the present invention, it may be possible to map a 
single request to two or more target devices. 

[0079] If the decode fimction (carried out by the decode block) is complex or a large 
number of target ports are instantiated, then the re-timer may be set to provide a one 
cycle delay. The re-timer providing the delay may be the request arbiter re-timer 40 
and/or the initiator re-timer 25. 

[0080] The retum_decode_f fimction defines the mapping between the response packets 
and the initiator device. Again, the fimction provides a one hot encoded array including 
one bit per initiator port. Element 1 of the array corresponds to the first initiator port and 
the last element corresponds to the last initiator port. Mapping is again preferably point 
to point and in preferred embodiments of the present invention, mapping a single request 
to multiple ports is forbidden. 

[0081] Again, if the decode fimction is complex or a large number of ports are 
instantiated, then it may be necessary to ensure that the target re-timer 32 and/or the 
response arbiter re-timer 44 delay the packet by at least one cycle. 

[0082] In embodiments of the present invention, any initiator port is able to 
communicate with any target port. However it is possible to define for each initiator port 
which target ports it is able to send requests. Likewise, for each target port it is possible 
to define from which initiator ports it is able to receive requests. 

[0083] In embodiments of the invention, the model permits one target port to be in 
communication with one initiator port at the same time that another target port is in 
communication with another target port. It is of course possible to use the model in the 
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situation where only one initiator is able to communicate wdth one target port at one 
time. 

[0084] In the model shown in Figure 2, the request part of the bus is shown separately 
from the response part of the bus. This model thus can be used to support a split 
transaction bus where there is a response bus and a request bus. A response can be put on 
the response bus at the same time that a request is put on the request bus. However it 
should be appreciated that the model can be adapted to the situation where the responses 
and the requests share a bus. Li this scenario, only one arbiter is provided and it will 
arbitrate between the requests and the responses to allow access to the bus. 

[0085] The model shown in Figure 2 can also be used where there are two or more 
response or request segments. In other words, if there are two request segments, then one 
request can be allowed on one request segment at the same time that another request is 
allowed on another request segment. 

[0086] The model used in embodiments of the present invention is able to have 
pipelined processing, non pipelined processing or a mix of the two. The pipelining may 
be transaction and/or arbitration pipeHning. If the processing is pipelined, the re-timing 
stage may provide a delay of one cycle. This re-timing, as shown in the model of Figure 
2 may be provided prior to and/or after arbitration. The re-timing buffer provided after 
arbitration may accommodate pipelining in or associated with the target. 

[0087] The model used in embodiments of the present invention can support ordered or 
unordered initiator ports and ordered or unordered target ports. In other words the 
initiator port can support an arrangement in which the responses have to come back in 
the same order as the corresponding requests were issued or an arrangement in which the 
responses are able to come back in any order. 

[0088] A macro is produced from the defmed parameters which connects a number of 
initiator ports to a number of target ports via an interconnect. This allows every initiator 
to communicate with every target as defmed by the address map. All available 
bandwidth to each target is allocated in dependence on the selected arbitration method. 
The maximum possible throughput is preferably maintained, up to a maximum or one 
request/response pair per cycle per interconnect resource. The following properties can 
be maintained: 
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[0089] for any initiator port, the transactions and locked transactions are maintained as 
atomic groups - that is the initiator and target are tied to each other until a transaction or 
a group or locked transactions have been completed; 

[0090] where initiators are unable to maintain an ordering internally, the interconnect 
does; 

[0091] within transactions and locked transaction sequences, ordering can be 
maintained; 

[0092] for each initiator port, ordering is maintained and it can also be assumed that 
each target also maintains ordering so that no access hazards occur, resulting from the 
incorrect order of requests or responses; 

[0093] each initiator is able to behave if it is logically independent; and 

[0094] locked sequences of transfers, that is a group of related transactions, are 
maintained together as atomic groups. 

[0095] hi one version of the model shown in Figure 2, the access queue block in the 
initiator port is omitted This model can be used if the return addressing function for the 
responses is unknown, hi other words, the return addressing of the responses is implicit 
and full ordering dependency checks are completed. These properties are achieved by 
using the queue 28 in the target device to track the status and source of every request in 
the system, create an information database to complete dependency checks and to 
perform return access mapping using tag based virtual addresses. The source information 
and the transaction identification information can be used by the target devices to reorder 
the request and responses. 

[0096] This requires that the target have storage capacity which is not appropriate in 
some integrated circuits, hi this arrangement, the addressing of the responses uses an 
internal address. 

[0097] In an alternative modification to the model shown in Figure 2, the access queue 
is not provided in the target port. The access queue is provided in the initiator port. This 
model may be cheaper to implement in that in the integrated circuit the burden for the 
transaction ordering is placed on the initiator. If the device does not require the ability to 
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pipeline transactions or is able to support more complex ordering model, the dependency 
stores may be removed. 

[0098] Reference is now made to Figure 4 which illustrates the concept of the model of 
Figure 2 in more detail. It should be appreciated that Figure 4 illustrates schematically 
the various steps carried out by the computer program to provide the resulting 
description. 

[0099] The address decoder 20 receives the following information: 

address of the target port 10 via input 100; 

the request to be sent to the target via input 102; and 

the source of the request via input 104 (this is optional). 

[0100] The address decoder 20 also receives decode information via input 106 which 
controls how the address of the target should be decoded. For example, if the address 
is address 1, then the target is target port 1, if the address is address 2, then the target 
is target port 2, and so on. The address decoder 20 has a register with one bit per 
device so that the address decoder 20 can convert the address into the internal address. 
The target is now defined by one bit. In other words the address decoder converts the 
address into a one hot encoded vector indicating the target port. The output of the 
address decoder 20 is a hot encoded vector only one bit of which has the value 1 . If 
bit n of the vector is set to '1', then the request will be routed to target n. 

[0101] The dependency block 24 looks at the defined properties of the initiator and 
targets defined by the ordering information and ensures that the routing network or 
the initiator port orders the responses. The dependency block 24 is able to ensure for 
example that an initiator port is only able to send a request to a different target when 
the initiator has received a response from the current target port to which it has sent a 
packet, if this is required. The dependency block provides a target vector C which 
defines an allowed request. The output of the dependency stage is target vector C. 

[0102] The dependency block 24 makes a decision of the allowability of a request. If 
a request would cause a hazard or a functional error, that request is not presented to 
the next stage. The information available to the dependency block includes: 
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[01031 all outstanding accesses from the associated initiator (using response 
information from the target via input 1 14); 

[0104] the ordering characteristics of the associated initiator and all the target devices 
. (via inputs 108 and 110 respectively); 

[0105| the current owner (the initiator currently commimicating) of each target. 

[0106] The dependency block 24 checks if a target port is available to an initiator port 
(via input 112). The dependency block is also arranged to check if the initiator port 
supports pipeline accesses and if the initiator port requires the request and response 
ordering to be preserved. 

[0107] The dependency block 24 also check if there is an outstanding transaction 
which could cause an ordering hazard. This uses information in the access queue 
block 22. An ordering hazard may for example occur if a second request is presented 
to a target device capable of reordering or a second request from a given initiator is 
begun on a different target device to the first request. The dependency block is 
provided with target vector B from the access queue as will be discussed in more 
detail later. 

[0108] The dependency block 24 may implement the following rules which are 
summarized in the table below: 



INITIATOR 


TARGET 


OUTSTANDING REQUESTS 


ORDERED 


ORDERED 


AS MANY AS REQUIRED, IF 
THE PREVIOUS ACCESS IS TO 
DIFFERENT DEVICE, DELAY 
UNTIL RESPONSE FROM 
PREVIOUS REQUEST HAS 
RETURNED 


ORDERED 


UNORDERED 


ONLY ONE AT A TIME 


UNORDERED 


ORDERED 


AS MANY AS REQUIRED 


UNORDERED 


UNORDERED 


AS MANY AS REQUIRED 
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[0109] This table summarizes the number of outstanding requests which are permitted 
with various combinations of ordered and unordered initiators and targets. In practice, 
the number of outstanding requests will be defined by the access queue. 

[0110] If the initiator wants access to a target which is locked to another target, that 
initiator is prevented by the dependency unit from presenting the request to another 
initiator. 

[0111] For unordered ports, the dependency unit does not contain logic connecting 
vector A to vector C. 

[0112] The queue 22 has a defined depth. If the number or requests which have not 
been serviced exceeds the defined depth, no new requests are accepted until one or 
more of the outstanding requests have been serviced. The queue is used to control the 
dependency block 24. 

[0113] The queue 22 contains a vector FIFO 150. The vector FIFO 150 contains 
enough space only to accommodate the maximum number of requests. The FIFO also 
has a portion 1 52 which indicates for each space allocated to a possible request if that 
space contains a request. This portion is connected to logic 154 which is able to 
determine if the FEFO 150 is full or not. The logic 154 provides an output, indicating 
if the FIFO is full and therefore not able to accept further requests, to the dependency 
block 24 via connection 156. 

[0114] The FIFO receives target vector A as an input via controller 158. The 
controller 158 allows target vector A to be written into the FIFO if it is allowed by the 
dependency block 24. The dependency block therefore provides the controller 158 
with a control signal not shown. 

[0115] The FIFO is connected to first logic 160 and second logic 162. The first logic 
receives a request signal, a grant signal and an end of packet signal via lines 164 to 
168. A packet may contain more than one request. This logic ensures that where the 
request and grant signals are T and the first request of a packet has been identified, 
the target associated with target vector A is put into the queue. At all other times, no 
action is taken by the first logic 160. 
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[0116] The second logic 162 receives a response request signal, a response grant 
signal and a response end of packet signal via lines 170, 172 and 174 respectively. 
Again a packet may contain more than one response. The second logic will take out a 
served request from the FIFO when all three signal are T . Thus the request is only 
removed from the FIFO when the end of the packet containing the responses is 
received. The FIFO will store in embodiments of the present invention target vector A 
associated with each response. 

[0117] The logic OR part of the access queue provides target vector B which defines 
for which targets there are currently outstanding requests. This information is output 
to the dependency block 24 which uses this information to prevent a hazard. 

[0118] If necessary, a re-timer stage 25 is connected to the output of the dependency 
stage. A control signal is provided to the re-timer via input 116 which controls the 
delay provided by the re-timer stage 25. The re-timer stage 25 has a first multiplexer 
1 80, a second multiplexer 1 82 and a buffer 1 84. The first multiplexer 1 80 receives the 
output 186 of the dependency block 24, a grounded input 188 and an output 190 of 
the buffer 184. The control signal for the first multiplexer is provided by a switch 192 
which selects one of the request, grant and end of packet signals as a control signal. 
The switch 192 thus controls the multiplexer to let the appropriate input to the first 
multiplexer 1 80 therethrough. 

[0119] The buffer 184 receives the output of the first muUiplexer 180. The buffer 184 
provides the delay. 

[0120] The second multiplexer 182 receives the output 190 of the buffer 184 and the 
input target vector C from the dependency block 24. If a delay is required, the control 
signal from line 116 causes the output of the buffer 184 to be output 190 by the 
second multiplexer 1 82 and if no delay is required, the second multiplexer outputs the 
target vector C which has bypassed the buffer 184. The buffer 184 provides a one 
cycle delay if required. 

[0121] The re-timer circuit provides an additional lock to stop request ghost 
duplication from occurring. This can occur where a packet -contains more than one 
request so that more than one response is received to a given packet. A one cycle 
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delay is inserted between subsequent requests. The output of the re-timer stage is 
target vector C which is delayed if required. 

[0122] The output of the re-timer stage 25 is connected a reorganization part 194 
which implements the following: 

[0123] resource vector D = (resource/target mapping) x (target vector C) 

[0124] The resource vector D is thus output from the reorganization part 194 which 
takes into account the mapping between the resource (interconnect) and the target and 
the target vector C. The reorganization part 194 does this for a number of target 
vectors C associated with different initiators. 

[0125] The arbiter 38 makes an arbitration decision based on each packet of 
information. The arbiter uses the following information to make the arbitration 
decision: 

the initiator making the request; the number of outstanding requests (via input 
120) (this is optional); 

the availability of the target (via input 122); the arbiter makes a decision once per 
packet; and the arbitration method (defined by input 118). 

[0126] The arbiter 38 comprises a packet control block 196 which is arranged to 
maintain packet integrity. In order to perform this function, the packet control block 
196 receives the request, grant and end of packet signals. The output of the packet 
control block is connected to the arbitration algorithm part which also receives the 
arbitration method signal via line 118, the availability of the target via input 122 and 
the number of outstanding requests via input 120. The arbitration algorithm part also 
receives the requests from the various initiators output by the reorganization part 194. 

[0127] The arbiter 38 also includes a table 200 which defines the priority of the 
initiators. The initiator with the highest priority is at the top of the table followed by 
the initiator with the next highest priority and so on. 

[0128] The arbitration algorithm used the priority information from the table to allow 
the initiator with the highest priority which cam make an allowable request to win 
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access to interconnect. The algorithm also decides the priority of the initiators for the 
next decision and changes the order of the initiators so that the highest priority 
initiator is at the top of the table. The order of the initiators is determined in 
accordance with the selected arbitration algorithm. The table thus provides some 
information regarding the previous history. 

[0129] The output of the arbiter 38 is thus one request which is resource vector E 
which represents a request from the initiator which has won the arbitration. The 
output of the arbiter is input to a re-mapper 202 which implements the following 
function: 

target vector = (resource/target mapping) -1 (resource vector) 

[0130] A delay stage having the same structure as delay stage 25 may be provided 
here. 

[0131] The request transport 34 accepts information indicating which initiator is to be 
connected to which target. The number of connections possible per cycle is 
determined by the number of connect resources available. In simple systems, this may 
be a single shared bus for all target devices or in high performance systems a fully 
connected cross bar may be provided. 

[0132] If the locking stage 26 is provided, the locked request is passed to a target 
device. That information is stored here until that lock is released by a second request 
packet. 

[0133] Reference will now be made to Figure 5 which shows the target side of the 
model of Figure 2 in more detail. 

[0134] The address decoder 30 is similar to the address decoder 30 on the initiator 
side and receives the information identifying the target on input 210, the return source 
212 for the response and the output 214 of the access queue 28. This provides an 
output vector which indicates the return address for the response. The output vector 
takes the same format as that output by the address decoder 20. 

[0135] The access queue 28 stores information on what requests have been made and 
to which initiator that request is associated. This is used to address response packets 
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back to the initiator port. If the target port is able to return responses out of order, then 
items are removed from this queue using an associated match on the source and 
transaction identifier fields of the request. If the target port is not able to return 
responses out of order, a FIFO can be used. The access queue also provides an output 
to the dependency block. 

[0136] The decode block 30 may convert tag information stored in the access queue 
into a local response address. 

[0137] The locking block 26 has a bit which indicates if the associated target is 
locked to a particular initiator and if so the identity of that initiator. This provides an 
output v/hich is used by the dependency block 24. 

[0138] The output of address decode block is input to re-timer 32 v^hich has the same 
structure as re-timer 25 shown in Figure 4. The output of the re-timer is input to a 
reorganization part 220 which has the same structure reorganization part 194 and 
which performs the same function. The output of the reorganization part 194 is 
connected to the input of the target arbiter 42. 

[0139] The target arbiter 42 makes the arbitration decision based on the following 
information: 

the target requesting access to the transport resource; 
the availability of the initiator; 
the arbitration method; and 

makes the decision once per packet. 

[0140] The target arbiter has the same structures as the arbiter 38 of Figure 4. A re- 
timer stage may be provided after the target arbiter having the same structure as re- 
timer 25 of Figiu-e 4. A re-mapper 222 is provided which is similar to the re-mapper 
discussed in relation to Figure 4. 

[0141] The response transport 35 accepts a two dimensional array of information 
defining which initiator is connected to which target. The number of connections 
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which are possible in a given cycle are determined by the number of connect 
resources available. This may be a shared bus or a fully connected cross bar. 

[0142] The nature of the request and response transport will be determined based on 
the number of initiators and the number of targets. 

[0143] The arrangement shown in Figures 4 and 5 is able to deal with posted 
accesses. In the vector stored in the access queue, no bits are set for posted access. 
However, this posted access request will be stored. A false response to that request is 
generated and passed back to the access queue via the target interconnect. This false 
response causes the request to be removed from the queue. This false response is 
provided before any true response can be provided. The interconnect also treats the 
false response as the response. To deal with the subsequent true response a checker 
can be provided between the target interconnect and the initiator which removes the 
true response. The true response is thus not presented to the access queue or to the 
initiator. 

[0144] The initiator uses posted access when it is not interested in knowing when a 
request has been performed. The initiator will assume that the request has been acted 
on correctly. This can be used to achieve pipelining if a response to a request has to be 
received before a further request is permitted. By using the false response, the system 
can be speeded up. 

[0145] It should be appreciated that some of the structures used in the model can 
advantageously also be used in the final integrated circuit which is manufactured. 
However the options which are provided in the model will not generally be present in 
the final product. For example, the integrated circuit can use the arbiter structure 
described hereinbefore but the arbiter will be programmed with a single arbitration 
method only. The delay structure will either be present or absent. If present the bypass 
path is not provided. Other blocks may not require modification when implemented in 
the integrated circuit. 

[0146] The model described hereinbefore permits a RTL (Register to transfer level) 
description of the integrated circuit to be obtained. This is a gate level description of 
the system. 
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[0147] Alternatively or additionally, this model can be used to obtain a functional 
description of the system. The functional description will describe how each of the 
elements of the integrated circuit behave but will not have the timing of the final 
integrated circuit. 

[0148] Alternatively or additionally, this model can be used to obtain a performance 
description of the system. The performance description of the system will include a 
functional description of the integrated circuit and will include the timing of the final 
integrated circuit. 

[0149] The performance and functional descriptions of the system are higher level 
descriptions of the circuit. These descriptions can be used by other computer 
programs such as C so as to generate the gate level design of the integrated circuit. 

[0150] hi one embodiment of the invention, the modeling method described 
hereinbefore is performed a number of times. Each time the method is performed, 
more parameters are defined or introduced- This allows high level problems to be 
more quickly identified. 

[0151] Reference will now be made to Figure 6 which shows how the method is 
implemented. The values of the parameters described previously are defined and are 
input to the model of Figure 2 which has been defined in the vhdl or similar language. 
The output of the vhdl program provides a computer net list. This provides a 
description of the integrated circuit. 

[0152] It should be appreciate that the various parameters can be set in any suitable 
order. In the example described hereinbefore, certain values of the parameters are 
defined as meaning certain specific things. The certain values of the parameters can of 
course represent something different. 

[0153] In summary, the model described hereinbefore can provide the following 
features: 

support for the generation of interconnect routing elements of n x m where n is the 
number of initiator ports and m is the number of target ports and m and n are 
integers equal to or greater than 1 ; 
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creation of a fully connected non-blocking network supporting up to 2 transfers 
per target per cycle. A non-blocking network is one in which a first initiator can 
communicate with a first target at the same time that a second initiator can 
communicate with a second target; 

the selection of the transaction ordering mode for each initiator, that is whether the 
ordering is enforced by the initiator or the transport; 

the selection of direct or pipelined address (that is whether or not a re-timing stage 
is present) and dependency processing for each initiator port; 

the selection of direct or pipelined arbitration processing for each target port (that 
is whether or not the arbitration re-timing stage is present); 

the selection of the mode of arbitration for each routing element; and 

support for locked transaction sequences. 

[0154) The resultant macro of embodiments of the present invention is a fully 
synthesisable vhdl module which is referenced directly in the final design. It is 
possible to define a macro for different sub blocks of the design. Each sub block may 
be fiiUy defined and independent and may be used to create a design specific 
implementation. 

[0155] The interconnect used for testing purposes may be in the form of an AND/OR 
tree or be tristate or functional multiplexing. 
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